Volc Engine Launches Doubao Speech Recognition Model 2.0 to Improve Multilingual Recognition Accuracy
Volc Engine launches Doubao Speech Recognition Model 2.0, significantly enhancing inference capabilities and supporting multilingual and visual information recognition. The model is based on a 2 billion parameter audio encoder, optimized for complex scenarios, improving the accuracy of recognizing proper nouns, names, places, and polyphones.